NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

An LLM-Based Framework for Simulating, Classifying, and Correcting Students' Programming Knowledge with the SOLO Taxonomy

https://doi.org/10.1145/3641555.3705125

Zhang, Shan; Meshram, Pragati Shuddhodhan; Ganapathy_Prasad, Priyadharshini; Israel, Maya; Bhat, Suma (February 2025, ACM)

Novice programmers often face challenges in designing computational artifacts and fixing code errors, which can lead to task abandonment and over-reliance on external support. While research has explored effective meta-cognitive strategies to scaffold novice programmers' learning, it is essential to first understand and assess students' conceptual, procedural, and strategic/conditional programming knowledge at scale. To address this issue, we propose a three-model framework that leverages Large Language Models (LLMs) to simulate, classify, and correct student responses to programming questions based on the SOLO Taxonomy. The SOLO Taxonomy provides a structured approach for categorizing student understanding into four levels: Pre-structural, Uni-structural, Multi-structural, and Relational. Our results showed that GPT-4o achieved high accuracy in generating and classifying responses for the Relational category, with moderate accuracy in the Uni-structural and Pre-structural categories, but struggled with the Multi-structural category. The model successfully corrected responses to the Relational level. Although further refinement is needed, these findings suggest that LLMs hold significant potential for supporting computer science education by assessing programming knowledge and guiding students toward deeper cognitive engagement.
more » « less
Full Text Available
Enhancing Language Models with Idiomatic Reasoning

Zhou, Jianing; Zeng, Ziheng; Gong, Hongyu; Bhat, Suma (October 2024, Conference on Language Modeling)

Full Text Available
CLASP: Cross-modal Alignment Using Pre-trained Unimodal Models

Zhou, Jianing; Zeng, Ziheng; Gong, Hongyu; Bhat, Suma (August 2024, Association for Computational Linguistics)

Full Text Available
No Context Needed: Contextual Quandary In Idiomatic Reasoning With Pre-Trained Language Models

https://doi.org/10.18653/v1/2024.naacl-long.272

Cheng, Kellen; Bhat, Suma (January 2024, Association for Computational Linguistics)

Full Text Available
Unified Representation for Non-compositional and Compositional Expressions

https://doi.org/10.18653/v1/2023.findings-emnlp.783

Zeng, Ziheng; Bhat, Suma (January 2023, Findings of the Association for Computational Linguistics: EMNLP 2023)

Accurate processing of non-compositional language relies on generating good representations for such expressions. In this work, we study the representation of language non-compositionality by proposing a language model, PIER+, that builds on BART and can create semantically meaningful and contextually appropriate representations for English potentially idiomatic expressions (PIEs). PIEs are characterized by their non-compositionality and contextual ambiguity in their literal and idiomatic interpretations. Via intrinsic evaluation on embedding quality and extrinsic evaluation on PIE processing and NLU tasks, we show that representations generated by PIER+ result in 33% higher homogeneity score for embedding clustering than BART, whereas 3.12% and 3.29% gains in accuracy and sequence accuracy for PIE sense classification and span detection compared to the state-of-the-art IE representation model, GIEA. These gains are achieved without sacrificing PIER+’s performance on NLU tasks (+/- 1% accuracy) compared to BART.
more » « less
Full Text Available
Non-compositional Expression Generation Based on Curriculum Learning and Continual Learning

https://doi.org/10.18653/v1/2023.findings-emnlp.286

Zhou, Jianing; Zeng, Ziheng; Gong, Hongyu; Bhat, Suma (January 2023, Findings of the Association for Computational Linguistics: EMNLP 2023)

Non-compositional expressions, by virtue of their non-compositionality, are a classic ‘pain in the neck’ for NLP systems. Different from the general language modeling and generation tasks that are primarily compositional, generating non-compositional expressions is more challenging for current neural models, including large pre-trained language models. The main reasons are 1) their non-compositionality, and 2) the limited data resources. Therefore, to make the best use of available data for modeling non-compositionality, we propose a dynamic curriculum learning framework, which learns training examples from easy ones to harder ones thus optimizing the learning step by step but suffers from the forgetting problem. To alleviate the forgetting problem brought by the arrangement of training examples, we also apply a continual learning method into our curriculum learning framework. Our proposed method combined curriculum and continual learning, to gradually improve the model’s performance on the task of non-compositional expression generation. Experiments on idiomatic expression generation and metaphor generation affirm the effectiveness of our proposed curriculum learning framework and the application of continual learning. Our codes are available at https://github.com/zhjjn/CL2Gen.git.
more » « less
Full Text Available
IEKG: A Commonsense Knowledge Graph for Idiomatic Expressions

https://doi.org/10.18653/v1/2023.emnlp-main.881

Zeng, Ziheng; Cheng, Kellen; Nanniyur, Srihari; Zhou, Jianing; Bhat, Suma (January 2023, Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing)

Idiomatic expression (IE) processing and comprehension have challenged pre-trained language models (PTLMs) because their meanings are non-compositional. Unlike prior works that enable IE comprehension through fine-tuning PTLMs with sentences containing IEs, in this work, we construct IEKG, a commonsense knowledge graph for figurative interpretations of IEs. This extends the established ATOMIC2020 graph, converting PTLMs into knowledge models (KMs) that encode and infer commonsense knowledge related to IE use. Experiments show that various PTLMs can be converted into KMs with IEKG. We verify the quality of IEKG and the ability of the trained KMs with automatic and human evaluation. Through applications in natural language understanding, we show that a PTLM injected with knowledge from IEKG exhibits improved IE comprehension ability and can generalize to IEs unseen during training.
more » « less
Full Text Available
Generate, Prune, Select: A Pipeline for Counterspeech Generation \\against Online Hate Speech

Zhu, Wanzheng.; Bhat, Suma (January 2021, Findings of the Association for Computational Linguistics)
null (Ed.)
Countermeasures to effectively fight the ever increasing hate speech online without blocking freedom of speech is of great social interest. Natural Language Generation (NLG), is uniquely capable of developing scalable solutions. However, off-the-shelf NLG methods are primarily sequence-to-sequence neural models and they are limited in that they generate commonplace, repetitive and safe responses regardless of the hate speech (\eg, ``Please refrain from using such language.") or irrelevant responses, making them ineffective for de-escalating hateful conversations. In this paper, we design a three-module pipeline approach to effectively improve the diversity} and relevance. Our proposed pipeline first generates various counterspeech candidates by a generative model to promote \textit{diversity}, then filters the ungrammatical ones using a BERT model, and finally selects the most \textit{relevant} counterspeech response using a novel retrieval-based method. Extensive Experiments on three representative datasets demonstrate the efficacy of our approach in generating diverse and relevant counterspeech.
more » « less
Full Text Available
Abusive Language Detection in Heterogeneous Contexts: Dataset Collection and the Role of Supervised Attention

Gong, Hongyu; Valido, Alberto; Ingram, Katherine; Fanti, Giulia; Bhat, Suma; Espelage, D. (May 2021, Proceedings of the AAAI Conference on Artificial Intelligence)
null (Ed.)
Abusive language is a massive problem in online social platforms. Existing abusive language detection techniques are particularly ill-suited to comments containing heterogeneous abusive language patterns, i.e., both abusive and non-abusive parts. This is due in part to the lack of datasets that explicitly annotate heterogeneity in abusive language. We tackle this challenge by providing an annotated dataset of abusive language in over 11,000 comments from YouTube. We account for heterogeneity in this dataset by separately annotating both the comment as a whole and the individual sentences that comprise each comment. We then propose an algorithm that uses a supervised attention mechanism to detect and categorize abusive content using multi-task learning. We empirically demonstrate the challenges of using traditional techniques on heterogeneous content and the comparative gains in performance of the proposed approach over state-of-the-art methods.
more » « less
Full Text Available
Self-Supervised Euphemism Detection and Identification for Content Moderation

https://doi.org/10.1109/SP40001.2021.00075

Zhu, Wanzheng; Gong, Hongyu; Bansal, Rohan; Weinberg, Zachary; Christin, Nicolas; Fanti, Giulia; Bhat, Suma (May 2021, Proceedings of the 2021 IEEE Symposium on Security and Privacy (SP))
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records